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Abstract 

In this paper we consider the problem of testing whether two samples of 
contaminated data, possibly paired, are from the same distribution. Is is 
assumed that the contaminations are additive noises with known moments 
of all orders. The test statistic is based on the polynomials moments of 
the difference between observations and noises. . A data driven selection 
is proposed to choose automatically the number of involved polynomials. 
We present a simulation study in order to investigate the power of the 
proposed test within discrete and continuous cases. A real-data example 
is presented to demonstrate the method. 

keyword contaminated data; data-driven; two sample test 



1 Introduction 

The classical two-sample problem concerning i.i.d. observations has been exten- 
sively studied in the literature. We propose in this paper to extend this problem 
to the case of two contaminated samples when a noise is added to each sample. 
More precisely, we consider two samples, Xi, • • • , X„ and Ui, ■ ■ ■ ,Uk, from the 
following two models 

X = Y + Z, and U = V + W, (1) 

where Y and Z (resp. V and W) are two independent random variables. It is 
also assumed that Z and W are independent. However this paper concerns inde- 
pendent as well as paired variables X and U since Y and V can be dependent. 
We keep this hypothesis through the paper putting n = k (the more general 
case being easily obtained). We assume that all moments of Z and W exist and 
are known. We are interested in testing the equality of the distribution of Y 
and V. Our aim is to construct an omnibus test for the general non parametric 
hypothesis 

Hq : Cy — against H\ : Cy ^ Cy, (2) 

where Cy and £y refers to the distribution of Y and V . For that we extend the 
one-sample smooth test inspired of Neyman (1937) (see also Rayner and Best, 
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1989, for a general introduction) to the two-sample case under dJ). For the 
one sample problem, the smooth test is an omnibus approach which consists in 
coming down to parametric hypotheses. Then the smooth statistic is composed 
of different elements each able to detect a departure from the null hypothesis. 
This approach can be naturally extended to the two sample case, as in Rayner 
and Best (2001) (see also Chervoneva and Iglewicz, 2005). In addition, Ledwina 
(1994) introduced a data driven procedure permitting to select automatically 
the number of elements of the statistic. The automatic selection is based on the 
Schwarz (1978) criterion. Janik-Wroblezca and Ledwina (2000) first used this 
technique combined with rank statistic for the two sample problem. Recently 
Ghattas et al. (2011) obtained a data driven test for the two paired sample 
problem. Various extensions of the data driven smooth test have been proposed, 
particularly in the context of survival data in Krauss (2009) when samples are 
right censored, or in the context of detection of changes in Antoch et al. (2008) 
reducing the problem to a two sample subproblem. 

From ([T]) it is clear that the unknown moments of Y (resp. V) can be 
expressed in terms of moments of X and Z (resp. U and W). The proposed 
smooth test is based on the difference between the k first moments of Y and 
V. The order k determines the number of components of the test statistic. We 
then adapt the data driven approach permitting to select automatically this 
number. We first consider the case where k varies between 1 and K, ior K a 
fixed integer. Then we let k tend to infinity more slowly than the sample size. 
For asymptotic results we make an assumption on the smallest eigenvalue of the 
sample covariance matrix. But in practice, the data driven procedure is effective 
in the first case with K fixed large enough, as shown in our simulations. Finally, 
we apply our method to the UEFA champion's league data from Meintanis 



Before describing our test procedure we offer a few examples that illustrate 
the situation ([1]). 

• Evaluation by experts. During an assessment, such as sensory analysis, it 
is very common that experts are biased in their judgments. This bias is 
commonly observed and assessed during training and can be assumed to 
be known in distribution. Typically, one can assume a normal distribu- 
tion with mean and variance associated with each expert. In this case, 
if we want to compare the distribution of two products evaluated by two 
experts, we are reduced to the situation ([Ij where X and U coincide with 
the two experts scoring, Z and W being their errors. 

• Ruin theory. Another situation that can be encountered in ruin theory is 
the random sum of claims, ^ e^, where Ci are i.i.d. random variables with 
known exponential distribution. The number of claims can be decomposed 
into a fixed known value, n, and a random value, N, representing an 
aggregation of different claims. Thus if we observe two sums 
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where and Vi are i.i.d., one problem is to compare the randomness 
structure Ni and N2, that is to test the equality of the distributions of 
these two variables. This problem coincides with ([1} since it is equivalent 
to testing the equality of the distributions of Y — ^ = 

• Mixture model. The deconvolution problem is also related to a mixture 
problem since a particular case of ([l} is the location mixture situation of 
the form 

fx {x) ^ j fvix- m)fz {dm) , fu {x) ^ J fv {x - m)fw (dm) , 

with m the location parameter, fy, fv the unknown mixed densities and 
fz-, fw tti6 known mixing densities. This situation can be encountered 
when finite mixture distributions have known components, and when the 
purpose is to compare their associated sub-populations associated with 
these components. We can also reverse the roles of Y, V and Z, W and be 
interested in the comparison of two linear mixed models with Gaussian 
noise and unknown random effects. 

• Extreme values. Contaminated model can be also viewed as a model for 
extremal values considering the convolutions 

X ^aY + Z, U = I3V + W, 

where a and /3 are Bernoulli with small parameter representing the occur- 
rence of an extreme event. Often the non-extreme distributions of Z and 
W are well observed and known and we can be interested in the compar- 
ison of the extreme distributions of Y and V. Assuming that one knows 
when these rare events occur, they are observed with a known noise as in 
©• 

• Scale mixture. Finally, it is current to observe the product of two variables, 
say 

X ^YZ, U ^ VW. 

For instance, that is the case for Zero Inflated distributions, when Y and 
V are Bernoulli random variables and Z and W are discrete random vari- 
ables. Without loss of generality, by translating all variables, we can use 
a log-transformation to recover ([l} . Many other cases can be envisaged as 
X = Z/Y andU ^ W/V with Z and W normally distributed. 

The paper is organized as follows. In Section 2 we introduce the method 
based on polynomial expansions for testing the equality of the two contaminated 
densities. In Section 3 we propose a simple data driven procedure that we 
extend to the case where the number of components of the statistic tends to 
infinity, with additional assumptions. In Section 4, finite-sample properties of 
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the proposed test statistics are examined through Monte Carlo simulations. The 
analysis of a the champion's league data set is provided in Section 5. Section 6 
contains a brief discussion. 



2 Statistical method 

Consider simultaneously two (possibly paired) samples Xi, - ■ ■ , X„ and Ui, - ■ ■ ,Un 
following ([1]) and such that all moments exist and characterize the associated 
distributions. Is is assumed that the moments of Z and W are known. From 
([T]) we have the following two expansions for all integer i 

i i 

E(X^) = ^Q,E(r^)z,_j, and E([/') = ^ c,jS(y^'K_j, (3) 

with Cy = - j)0): Zi-j = E(Z*-J), and w,-j = V.{W^~^). Write 

tti = E(y*) and bi = E(y*). The null hypothesis coincides with = 6^, Vi = 
1, 2, • • • , and our testing procedure reduces to the parametric testing problem: 
Vi = 1, • • • , fc, Oi — &i = 0, when k gets large. We shall let k tend to infinity, with 
a speed depending of the sample size, and its choice will be done automatically 
by a data driven method. Inverting ([3]) we get 

a,=nP^{X)) and = E(Q,([/)), (4) 

where Pi and Qi are polynomials of degree i. For instance the first three terms 
are 

Pi(x) = X — zi, 

P2{x) - .T2-2ziPi(a:)-Z2, 

Psix) ^ a;3-3ziP2(a;)-3z2Pi(a;)-Z3. 

To construct the test statistics we consider the vector of differences 

V,(k) = (P,(X,)-Q,(C/,))i<,<,, 

and we put 



Under i?Oi Jni}^) has mean zero and finite fc x fc variance-covariance matrix 

E(fc) =Eo(^i(fc)T/i(fc)'), 

where Eq denotes the expectation under i?o and Vi(fc) is the transposition of 
Vi(fc). Next, let us define the empirical version of under i/o, that is the 
k X k matrix 
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In the following, we assume that E„(fc) is a positive-definite matrix so that the 
corresponding inverse matrix and its square root exist. Note that this condition 
is satisfied a.s. for n large enough since the estimator is consistent. We consider 
the test statistic 

T„(fc) = J„(fc)'E„(fc)-V„(fc) = ||E„(fc)-i/V„(fc)||2, (5) 

where ||.|| denotes the euclidian norm on K*^. Application of the Central Limit 
Theorem shows that under Ho, Tn{k) converges in distribution to a random 
variable with k degrees of freedom as n tends to infinity. The strategy is to 
select an appropriate degree fc; that is, a correct number of components in the 
test statistics. In addition, observe that the null hypothesis can be rewritten 
as : 9 = where 6 = E(Vi(fc)). Suppose that the maximum likelihood 
estimator 9 oi 9 equals the empirical mean of the sample of the T4(fc)'s, that is 
9 = '^"^ , as it is the case for instance when the distribution of Vi(fc) belongs 
to an exponential family. Then, Tn{k) is the score statistic and the Schwarz 
criteria is well adapted to get an automatic selection of k. 

3 Data driven approach 

In this section, the data-driven method introduced by Ledwina (1994) (see also 
Inglot et al. 1997) is used to optimize the parameter k in our test statistic. 
It is based on a modified version of Schwarz's Bayesian information rule. The 
optimal value of fc, denoted by Sn, is such that 

5„ = min { argmax (T„(fc) - k log(n)) } , (6) 

l<k<d(n) 

where d{n) can be either fixed, equal to K, or increasing such that lim„^oo din) = 
oo. Once S'„ is determined, the test statistic is applied with k = Sn- More 
precisely, we use for our testing problem the statistic T„(S'„). Hereafter, the 
asymptotic distribution of the test statistic is derived under the null hypothesis 
for cases where d{n) is fixed or unbounded. 

3.1 The case where d{n) = K is fixed. 

Theorem 1 Assume that d{n) = K ^ I is fixed. Under Hq, when n tends to 
infinity, T„(S'„) converges in distribution to ay^ random variable with 1 degree 
of freedom. 

The proof is fairly standard and follows Ledwina (1994). We will detail a more 
general proof in the case where d{n) is unbounded (see Theorem [5]). 

Remark 1 In our simulations, we fixed K large enough, in the sense that its 
value was neither reached by Sn, either under the null (for empirical level cal- 
culations) or under alternatives (for empirical power calculations). 
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3.2 The case where d{n) is unbounded. 



Let us denote by Pq and Eq the probability and the expectation under Hq . Write 
'^min(^) the smallest eigenvalue of S„(A:). We now let d{n) tend to infinity under 
the following two conditions: 

(Al) d(n)VEo(Vin(rf(n))) = op„(log(n)). 

(A2) There exists some positive constant M such that for all fc > 0, 

k 

i=l ^ ^ 

where Zi = Pi{X)-Qi{U). 

Remark 2 The condition (Al) can be compared to results obtained in the frame- 
work of random matrices. For instance, Bai and Yin (1993) (see also Silver- 
stein, 1985, for the particular Gaussian case) considered the case where the 
entries Zij = Pi{Xj) — Qi{Xj) are independent and identically distributed with 
finite fourth moment (this moment condition may be compared with (A2)). They 
shown that almost surely \\m \^i^(d{n)) = 1 when d{n)/n — > 0. Then when the 
random series \min{d{n)) is bounded we g'etlimE(Amin(rf(n))) = 1 andd{n) can 
be chosen as opg (■\/log(n)). 

Assumption (A2) states that the fourth moment is bounded on average. It 
is similar to Assumption 2 stated in Ledoit and Wolf (2004)- More precisely, 
Ledoit and Wolf used a condition on the eighth moment which is somewhat more 
restrictive. 

Theorem 2 Let assumptions (Al) and (A2) hold. Thus, under Hq, T„(S'„) 
converges in distribution to a random variable with 1 degree of freedom. 

Proof The proof is partly inspired by Janic-Wroblewska and Lcdwina (2000). 
First note that the greatest eigenvalue of I]„(fc)~^ is the inverse of its smallest 
eigenvalue. Then we have |||S„(fc)^-^||| — 1/Amin(fc), where |||.||| stands for the 
spectral norm. Under iJo, it is clear that ^^(l) converges to a random 
variable with one degree of freedom. Then we have to prove that Po(<S'n = 1) 
tends to 1 as n tends to infinity, or equivalently that Po(5'„ > 2) tends to 0. Let 
us set a„(fc) = (fc — 1) logn. By definition of Sn, we have 

d{ri) d{n) , > 

Po(5„ > 2) = ^Po(5„ = fc) < ^Po(r„(A;)V2 > VM^)- (7) 

fc=2 fe=2 ^ ' 

Using the standard norm's inequalities for matrices and vectors we get 

Tu{k) = Jn{k)'%{k)-^Jn{k) < ||J„(A:)||2|||S„(A;)-i|||, (8) 
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that we combine with Markov inequahty to obtain 

Eo(||J„(fc)|||||S„(fc)-i|||V^) 



< 



< 



Eo(||J„(fc)|p)Eo(|||S„(ft)-i|||)"'^' 



1/2 



Using the independence of the pairs {Xs,Ys)i<s<m we get 

Eo(\\uk)\\A = Eo(-j2J2^M'Vtik) 

\ / \^ s=l t=l 

= l^J2^o(vs{kyvs{ki 

s—1 ^ 

= MWMm')- (9) 



We now remark that 



Eo(|lyi(fc)|p) = fc(i^Eo(Zf; 



i=l 



k ^ 1/2 

< fc(-^Eo(z2^' 

i=l 



^ i=l 



Finally, we have 



PoiSn >2)< sup 



l<k<din) VEo(An,in(fc))l/2y V VbgW 

Theorem [2] obtains as soon as we have shown that Eo(Aniin(fc))fe>o is a de- 
creasing sequence, which is clear since matrices (E(I]„(fc))fc>o are embedded by 
construction, that is, the k x k submatrix obtained from the k first lines and k 
first columns of S„(fc + 1) coincides (in distribution) with I]„(fc). 

Finally, the test procedure is consistent against any alternative having the 
form 

Hi{q) : 3g € N such that a,; = &i,Vi = 1, • • • , g — 1, and ^ bq, 
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where and bt are given by Q 

Proposition 1 Under Hi{q), T„(S'„) tends to infinity (in probability) as n ^ 

00. 

Proof First note that Um„_^oo d(n) > q. We now prove that hm„_>oo P{Sn < 
q) = 0. For k < q we have P(S'„ = fc) < P{Tn{k) > r„((7)). By the law of 
large numbers, the variable Jq/ \/n converges in probability to a non-null vector. 
Since E(g) is a positive definite matrix we have J'qTi{q)~^ Jq = 0-p{n) and the 
test statistics T„(g) — q\og(n) tends to +00 in probability under H[. By similar 
arguments, T„(fc) — fclog(n) tends to —00 under H[. Then P{Sn — k) ^ Q for 
all k < q. It follows that lim„_^oo P{Sn 5^ 9) = 1 and then r„(5„) tends to +00 
as n — ?► 00. 

Remark 3 It is well known that the sample covariance matrix performs poorly 
in the high dimensional setting. For applications in this context, we could change 
the sample covariance by a more suitable one. In Ledoit and Wolf (2004) o- 
linear shrinkage is proposed, E* = piI+p2'S, where I stands for the identity ma- 
trix and E for the sample covariance (like the one used in our paper) . Won et al. 
(2009) proposed a non linear shrinkage for Gaussian variance matrices. Writing 
the sample covariance matrix E = Qdiag{li, ■ ■ ■ ,lp)Q^ , their estimator has the 

form E* ~ Qdiag{Xi, • • • , \p)Q^ , where the A 's are constrained estimated eigen- 
values. Another approach is the thresholding procedure proposed in Cai and Liu 
(2011, see also El Karoui, 2008). Writing E — {(5ij)kxk the sample covariance 
matrix, a universal thresholding estimator is E* with a*j — aijl{aij ^ Ln}, 
with a proper choice of the threshold L„. Cai and Liu (2010) proposed the more 
adaptative thresholds Lij = 5{6ij\ogk/n)^/^ , with tuning parameter 5 and some 
fourth moment estimators 's. In our problem Oij should be the estimator of the 
variance ofVar{ZiZj). 

4 Numerical study 

Models and alternatives We present empirical powers of the test through 
several models. We will denote by V{m) the Poisson distribution with mean 
m, Af{a,b) the normal distribution with mean a and standard error 6, B{N,p) 
the binomial distribution on [0,A^] with probability of success p, xl ^^e chi- 
squared distribution with degree k. We consider four models under Hq and 
seven associated alternatives as follows: 

• Model MODI: Y ^ xl Z ^ 7V(0, 2), V ^ xl W ^ 7V(0, 0.1). 
Alternative All: V ^ xl^ W ^ 7V(0, 0.1). 

• Model M0D2: Y xl Z ^ 7V(0, 2), V ^ xl W ^ 7V(0, 1). 
Alternative A12: V ^ xl^ W ^ A/'(0, 1). 
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• Model MODS: Y ^ xl, Z ^ A^(0, 2),V ^xl,W ^ N{Q, 2). 
Alternative A13: V ^xl,W ^ 7V(0, 2). 

• Model M0D4: Y - 6(10, 0.5), Z - V{2), V - 6(10, 0.5), W - V{1). 
Alternative A21: V - 6(10,0.4) and W - V{1), 

Alternative A22: V ~ 6(10,0.6) and W ~ 7^(1), 
Alternative A23: V - 6(9,0.5) and - V{1), 
Alternative A24: V - 6(11,0.5) and W - V{1). 

For all models and alternatives we consider i.i.d. data (Xi, C/i), • • • , (X„, C/„) 
generated from two convolution models satisfying ([T]). It is assumed that Z and 
W have known distribution. 

Empirical levels We compute the test statistic based on a sample size n = 
30, 50, 100 and 200 for a theoretical level a — 5%. The empirical level of the 
test is defined as the percentage of rejection of the null hypothesis over 10000 
replications of the test statistic under the null hypothesis. We have fixed d{n) = 
10 arbitrarily large enough since the selected order does not exceed 4 in all our 
simulations. 

Empirical levels are reported in Table [T] for a fixed asymptotic level equal 
to 5%. It can be seen that all values are close to the asymptotic limit, also for 
small sample size. 

Table 1: Empirical levels for MODI, M0D2, MOD 3 and M0D4 with sample 
sizes 30, 50, 100, 200 



Model 


71 = 30 


n = 50 


n = 100 


n = 200 


MODI 


4.70 


5.07 


4.92 


4.90 


M0D2 


4.51 


4.98 


4.66 


5.01 


M0D3 


4.38 


4.72 


4.84 


4.94 


M0D4 


4.80 


4.93 


4.80 


4.63 



Empirical powers The empirical power of the test is defined as the percent- 
age of rejection of the null hypothesis over 10000 replications of the test statistic 
under Alternative. Empirical powers for alternatives A11-A13 are represented 
in Figure [T] In our knowledge, there is no equivalent method in the literature 
to compare contaminated distributions and then it is not possible to confront 
these powers. However, for alternative A13 Z and W have the same distribu- 
tion and the null hypothesis coincides with the equality of the two distributions 
Lx = ■ They consist in the convolution of a second order x^ distribution 
with a Gaussian distribution A/'(0, 2). Then, even if our method is not dedicated 
to the standard two-sample problem, we can compare its power with that of the 
classical Mann- Whitney test under A13. Figure [2] shows that these two tests 
have similar powers, with a slight advantage for the Mann- Whitney test with 



9 



large sample size. Note in Figure[3]that the alternative A13 is close to a transla- 
tion over the null distribution that can be advantageous for the Mann- Whitney 
test. 




50 75 100 125 150 175 200 



Sample size 




Figure 2: Empirical powers under alternatives A13 with the proposed method 
(♦) and with the Mann-Withney test (■^) with sample sizes 30, 50, 100, 200. 
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Figure 3: Density under the null model MODS with F - xl and Z - Af{0, 2) 
(left bold curve), and density under alternative A13 with V ^ xi and W ~ 
Af{0, 2) (right curve). 

Figure m presents the powers of the test for M0D4 with alternatives A21- 
A24. Both alternatives A21 and A22 are very well detected by the procedure. 
Under alternatives A23 and A24 the power is less good. These results are 
essentially due to the nearness between the distributions of Y and V and not in 
that between X and U. To illustrate this remark, Figure [5] shows the proximity 
between X and U at once for alternative A22 and for alternative A23. All 
distributions are very similar. But for alternative A22, the distributions of Y 
and V are closer than for alternative A23, explaining its better power. 
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Figure 4: Empirical powers for alternatives A21 (♦), A22 {if), A23 (■) and 

A24 (A) with sample sizes 30, 50, 100, 200. 
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(a) Distribution of X under M0D4 




(b) Distribution of U under alternative A22 (c) Distribution of U under alternative A23 

Figure 5: Distributions under M0D4 (a), alternative A22 (b) and alternative 
A23 (c) 

5 Illustration 

Table [2] reproduces paired data used by Meintanis (2007). It concerns matches 
of the UEFA Champion's League for the seasons 2004-05 and 2005-2006 where 
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there was at least one goal scored by the home team, and there was at least 
one goal scored directly from a kick by any team. The first variable X is the 
time (in minutes) of the first kick goal scored by either of the two team, and the 
second variable U is the time of the first goal of any type scored by the home 
team. 



Table 2: UEFA Champion's league data 



2005-2006 


X 


U 


2004-2005 


X 


U 


Lyon-Real Madrid 


26 


20 


Internazionale-Bremen 


34 


34 


Milan- Fenerbahce 


63 


18 


Real Madrid- Roma 


53 


39 


Chelsea- Anderlecht 


19 


19 


Man. United-Fernebahce 


54 


7 


Club Brugge- Juventus 


66 


85 


Bayern-Ajax 


51 


28 


Fenerbhace-PSV 


40 


40 


Moscow-PSG 


76 


64 


Internazionale-Rangers 


49 


49 


B ar celona- Shakht ar 


64 


15 


Panathinaikos-Bremen 


8 


8 


Leverkusen-Roma 


26 


48 


Aj ax- Arsenal 


69 


71 


Arsenal-Panathinaikos 


16 


16 


Man. United-Benfica 


39 


39 


Dynamo Kyiv-Real Madrid 


44 


13 


Real Madrid-Rosenborg 


82 


48 


Man. United-Sparta 


25 


14 


Villareal-Benfica 


72 


72 


Bayern-M. Tel-Aviv 


55 


11 


Juventus-Bayern 


66 


62 


Bremen-Internazionale 


49 


49 


Club Brugge-Rapid 


25 


9 


Anderlecht- Valencia 


24 


24 


Olympiacos-Lyon 


41 


3 


Panathinaikos-P S V 


44 


30 


Internazionale-Porto 


16 


75 


Arsenal- Rosenborg 


42 


3 


Shalke-PSV 


18 


18 


Liverpool-Olympiacos 


27 


47 


Barcelona-Bremen 


22 


14 


M. Tel-Aviv-Juventus 


28 


28 


Milan-Shalke 


42 


42 


Bremen-Panathinaikos 


2 


2 


Rapid- Juventus 


36 


52 









These data have been explored assuming a Marshall- Olkin distribution in 
Meinatis (2007) and with a bivariate generalized exponential distribution in 
Kundu and Gupta (2009). In Meintanis (2007) the conclusion was that the 
Champion's-League data may well have arisen from a Marshall- Olkin distribu- 
tion. In Kundu and Gupta (2009) the generalized exponential distribution can 
not be rejected for the marginals and the bivariate generalized exponential dis- 
tribution can be used for these data. We consider here another model through 
contaminated Poisson distributions. 

First model First we assume an additive noise 

X = Y + Z and U = V + W, (10) 

with Y and V having Poisson distributions and Z and W being dependent 
random noise with E(Z) = E(M^) = 0. This model can be viewed as a mixed 
model with Z and W as paired random effects. These effects can be considered 
as discrete or continuous as in Meintanis (2007) or Kundu and Gupta (2009). 
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We assume that Y and V have mean (estimated) 40.9 and 32.9. The observed 
variances are larger than the means thereby beheve there is a phenomenon of 
overdispersion. Obviously under pUj) we have Var{X) = Var{Y) + Var{Z) and 
Var{U) = Var{V) + Var{W). We apply our procedure to test the equality of 
the distributions of Z and W. 

Conclusion: The first statistic T(l) is retained and we obtain a p- value equal 
to 0.28. Hence there is no evidence that the two additive paired random effects 
differ. 

Second model We also consider a multiplicative noise yielding to the follow- 
ing scale mixture 

X = YZ and U = VW, (11) 

with Y and V having Poisson distributions and Z and W being real positive 
dependent random scale factor with E(Z) — K{W) = 1. Again this model can 
be viewed as a mixed model with random paired effects. The observed values 
are discretized but we can assume that Z and W are discrete or continuous. 
We assume that Y and V have mean (estimated) 40.9 and 32.9 and there is 
still a phenomenon of overdispersion assuming it is a standard Poisson model. 
Under ([TT]) the variances satisfy Var{X) = {2Var{Z) + l)E{X) and Var{U) = 
{2Var(W) + 1)E([/). Our purpose is to test Hq : Cz = Cw, or equivalently 
J~'\og{z) = ^iog(w)- For that wc consider the transformation of (fTTj) 

log(Z) ^ log(r) + log(Z) and \og{U) = \og{V) + \og{W). 

Conclusion: Using our method we obtain a p- value equal to 0.70. Again we see 
that the multiplicative paired random effects seem to have the same distribution. 

6 Discussion 

This paper discusses the problem of comparing two distributions contaminated 
by different noises. The test is very simple and allows to compare two indepen- 
dent as well as two paired contaminated samples. Simulation studies suggest 
that the proposed method works well with an empirical level close to that ex- 
pected. 

It may be noted that the test statistic is decomposed into moments of X, Z, U, 
and W . Then it is clear that only the knowledge of the moments of Z an W 
are required instead of their distributions. Hence the test could be adapted 
when these distributions are unknown, if their moments can be estimated from 
independent samples. 

Eventually, the multivariate case could be envisaged by using the following 
characteristic property: if Y and V are two random vectors taking values in R'^ 
then we have 

Ho-.Y ^'^V ^ y\\u\\ < 1, u'Y ^'^ u'V, 
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and clearly multidimensional observations can be transformed into unidimen- 
sional ones by applying a sequence of vectors m on X and U . For a fixed value 
of u the problem consists in an univariate test and the statistic T„(5„) can be 
used, denoting by T„(u) this statistic the process {T„(?i): ?i e (0, l)**} converges 
to a Gaussian process and a new test statistic can be envisaged by estimating 
the covariance operator of the process to get a null distribution. In practice 
the sequences of vectors u can be randomly chosen in (0, 1)'', but it can also be 
done by a Quasi Monte Carlo method (see for instance L'Ecuyer, 2006). 

To conclude, the multisample case can also be envisaged as follows: assume 
that we have d convolutions simultaneously 

X{i)=Y{i) + Z{i), i = l,---,d 

observed from d samples. Write aj{i) = K(Y{iy) and oij ~ ^SiLiQ:j(*) the 
common value under the null hypothesis Hq : jOry^i) = ■■■ = Cyid)- Then 
under Hq the kx d vector D with components Dij = aj{i) — aj is centered and 
normally distributed. An adaptation of the data driven smooth test seems then 
possible. 
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